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Abstract 

Consider the random graph process where we start with an empty graph on n vertices, and at 
time t, are given an edge e± chosen uniformly at random among the edges which have not appeared 
so far. A classical result in random graph theory asserts that whp the graph becomes Hamiltonian 
at time (l/2 + o(l))nlogn. On the contrary, if all the edges were directed randomly, then the graph 
has a directed Hamilton cycle whp only at time (1 + o(l))nlogn. In this paper we further study 
the directed case, and ask whether it is essential to have twice as many edges compared to the 
undirected case. More precisely, we ask if at time t, instead of a random direction one is allowed 
to choose the orientation of et, then whether it is possible or not to make the resulting directed 
graph Hamiltonian at time earlier than nlogn. The main result of our paper answers this question 
in the strongest possible way, by asserting that one can orient the edges on-line so that whp, the 
resulting graph has a directed Hamilton cycle exactly at the time at which the underlying graph is 
Hamiltonian. 

1 Introduction 

The celebrated random graph process, introduced by Erdos and Renyi [II] in the 1960's, begins with 
an empty graph on n vertices, and in every round t = 1, . . . , m adds to the current graph a single new 
edge chosen uniformly at random out of all missing edges. This distribution is commonly denoted as 
G n , m . An equivalent "static" way of defining G n ^ m would be: choose m edges uniformly at random 
out of all (2) possible ones. One advantage in studying the random graph process, rather than the 
static model, is that it allows for a higher resolution analysis of the appearance of monotone graph 
properties (a graph property is monotone if it is closed under edge addition). 

A Hamilton cycle of a graph is a simple cycle that passes through every vertex of the graph, and a 
graph containing a Hamilton cycle is called Hamiltonian. Hamiltonicity is one of the most fundamental 
notions in graph theory, and has been intensively studied in various contexts, including random graphs. 
The earlier results on Hamiltonicity of random graphs were obtained by Posa |20| . and Korshunov |17| . 
Improving on these results, Komlos and Szemeredi [16] proved that if m! = \n log n + ^ log log n + c n n, 
then 

if c„, — > —00 
lim F(G n m i is Hamiltonian) — < e e if c n — > c 

1 if c n — > 00. 
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One obvious necessary condition for the graph to be Hamiltonian is for the minimum degree to be 
at least 2, and surprisingly, the probability of G n ^ m i having minimum degree two at time m! has 
the same asymptotic behavior as the probability of it being Hamiltonian. Bollobas [Zj strengthened 
this observation by proving that whp the random graph process becomes Hamiltonian when the last 
vertex of degree one disappears. Moreover, Bollobas, Fenner, and Frieze [8] described a polynomial 
time algorithm which whp finds a Hamilton cycle in random graphs. 

Hamiltonicity has been studied for directed graphs as well. Consider a random directed graph 
process where at time t a random directed edge is chosen uniformly at random among all missing edges, 
and let D n ^ m be the graph consisting of the first m edges. Frieze [13] proved that for m" = n log n+c n n, 
the probability of D n ,m" containing a (directed) Hamilton cycle is 



Similar to the undirected case, this probability has the same asymptotic behavior as the probability of 
the directed graph having minimum in-degree and out-degree 1. In fact, Frieze proved [14] that when 
the last vertex to have in- or out-degree less than one disappears, the graph has a Hamilton cycle whp. 

Hamiltonicity of various other random graph models has also been studied |21|, [3]. One model 
which will be of particular interest to us is the k-'m. /c-out model, in which every vertex chooses k 
in-neighbors and fc-out neighbors uniformly at random and independently of the others. Improving on 
several previous results, Cooper and Frieze [9] proved that a random graph in this model is Hamiltonian 
whp already when k = 2 (which is best possible since it is easy to see that a 1-in 1-out random graph 
is whp not Hamiltonian). 

1.1 Our Contribution 

Bollobas [7J, and Frieze's [H] results introduced above suggest that the main obstacle to Hamiltonicity 
of random graphs lies in "reaching" certain minimum degree conditions. It is therefore natural to ask 
how the thresholds change if we modify the random graph process so that we can somehow bypass 
this obstacle. 

We consider the following process suggested by Frieze [15] which has been designed for this purpose. 
Starting from the empty graph, at time t, an undirected edge (u, v) is given uniformly at random out 
of all missing edges, and a choice of its orientation (u — > v or v — > u) is to be made at the time of 
its arrival. In this process, one can attempt to accelerate the appearance of monotone directed graph 
properties, or delay them, by applying an appropriate on-line algorithm. It is important to stress 
that the process is on-line in nature, namely, one cannot see any future edges at the current round 
and is forced to make the choice based only on the edges seen so far. In this paper, we investigate 
the property of containing a directed Hamilton cycle by asking the question, "can one speed up the 
appearance of a directed Hamilton cycle?" . The best we can hope for is to obtain a directed Hamilton 
cycle at the time when the underlying graph has minimum degree 2. The following result asserts that 
directed Hamiltonicity is in fact achievable exactly at that time, and this answers the above question 
positively in the strongest possible way. 








— c 



if c n — > — oo 
if c n — > c 
if c n — > oo. 
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Theorem 1.1. Let Q be a random (undirected) graph process that terminates when the last vertex of 
degree one disappears. There exists an on-line algorithm Orient that orients the edges of Q, so that 
the resulting directed graph is Hamiltonian whp. 

Let us remark that Q whp contains (1 + o(l))n log n/2 edges, in contrast with (1 + o(l))nlogn 
edges in the random directed graph model. Thus the required number of random edges is reduced by 
half. 

Our model is similar in spirit to the so called Achlioptas process. It is well known that a giant 
connected component (i.e. a component of linear size) appears in the random graph G n ^ m when 
m = (1 + o(l))n/2. Inspired by the celebrated "power of two choices" result [2], Achlioptas posed 
the following question: Suppose that edges arrive in pairs, that is in round t the pair of edges (et, e' t ) 
chosen uniformly at random is given, and one is allowed to pick an edge out of it for the graph (the 
other edge will be discarded). Can one delay the appearance of the giant component? Bohman and 
Frieze answered this question positively [I] by describing an algorithm whose choice rule allows for 
the ratio m/n > 0.53, and this ratio has been improved since [5J. Quite a few papers have thereafter 
studied various related problems that arise in the above model [61 [131 IS (HI [23]. As an example, 
in [18], the authors studied the question, "How long can one delay the appearance of a certain fixed 
subgraph?" . 

One such paper which is closely related to our work is the recent work of Krivelevich, Lubetzky, and 
Sudakov [19] . They studied the Achlioptas process for Hamiltonicity, and proved that by exploiting 
the "power of two choices", one can construct a Hamilton cycle at time (1 + o(l))n log n/4, which is 
twice as fast as in the random case. Both our result and this result suggest that the "bottleneck" 
to Hamiltonicity of random graphs indeed lies in the minimum degree, and thus these results can be 
understood in the context of complementing the results of Bollobas [J], and Frieze [14] , 

I. 2 Preliminaries 

The paper is rather involved technically. One factor that contributes to this is the fact that we are 
establishing the "hitting time" version of the problem. That is, we determine the exact threshold for 
the appearance of a Hamilton cycle. The analysis can be simplified if one only wishes to estimate this 
threshold asymptotically (see concluding remarks). To make the current analysis more approachable 
without risking any significant change to the random model, we consider the following variant of the 
graph process, which we call the random edge process : at time t, an edge is given as an ordered pair 
of vertices et = (vt,wt) chosen uniformly at random, with repetition, from the set of all possible n 2 
ordered pairs (note that this model allows loops and repeated edges). In what follows, we use Gt to 
denote the graph induced by the first t edges, and given the orientation of each edge, use Dt to denote 
the directed graph induced by the first t edges. By m* we denote the time t when the last vertex of 
degree one in Gt becomes a degree two vertex. 

We will first prove that there exists an on-line algorithm Orient which whp orients the edges of the 
graph G mt so that the directed graph D mit is Hamiltonian, and then in Section [6] show how Theorem 

II. 11 can be recovered from this result. 
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I. 3 Organization of the Paper 

In the next section we describe the algorithm Orient that is used to prove Theorem 11.11 (in the 
modified model). Then in Section [3] we outline the proof of Theorem ll.il Section 0] describes several 
properties that a typical random edge process possesses. Using these properties we prove Theorem 

II. 11 in Section Then in Section [HJ we show how to modify the algorithm Orient, in order to make 
it work for the original random graph process. 

Notation. A directed 1-factor is a directed graph in which every vertex has in-degree and out-degree 
exactly 1, and a 1-factor of a directed graph is a spanning subgraph which is a directed 1-factor. The 
function exp(x) := e x is the exponential function. Throughout the paper log(-) denotes the natural 
logarithm. For the sake of clarity, we often omit floor and ceiling signs whenever these are not crucial 
and make no attempts to optimize our absolute constants. We also assume that the order n of all 
graphs tends to infinity and therefore is sufficiently large whenever necessary. 

2 The Orientation Rule 

In this section we describe the algorithm Orient. Its input is the edge process e = (ei, eg, . . . , e m „), 
and output is an on-line orientation of each edge et- The algorithm proceeds in two steps. In the 
first step, which consists of the first 2nloglogn edges, the algorithm builds a "core" which contains 
almost all the vertices, and whose edges are distributed (almost) like a 6-in 6-out random graph. In 
the second step, which contains all edges that follow, the remaining o(n) non-core vertices are taken 
care of, by being connected to the core in a way that will guarantee whp the existence of a directed 
Hamiltonian cycle. 

2.1 Step I 

Recall that each edge is given as an ordered pair (v,uu). For every vertex v we keep a count of the 
number of times that v appears as the first vertex. We update the set of saturated vertices, which 
consists of the vertices which appeared at least 12 times as the first vertex. Given the edge (v, w) at 
time t, if v is still not saturated, direct the edge (v, w) alternatingly with respect to v starting from 
an out edge (by alternatingly we mean, if the last edge having v as the first vertex was directed as 
an out edge of v, then direct the current one as an in edge of v, and vice- versa. For the first edge 
we choose arbitrarily the out direction). Otherwise, if v is saturated, then count the number of times 
that w appeared as a second vertex when the first vertex is already saturated, and direct the edges 
alternatingly according to this count with respect to w starting from an in edge. This alternation 
process is independent to the previous one. That is, even if w appeared as a first vertex somewhere 
before, the count should be kept track separately from it. 

For a vertex v € V, let the first vertex degree of v be the number of times that v appeared as a 
first vertex in Step I, and denote it as d\{v). Let the second vertex degree of v be the number of times 
that v appeared in Step I as a second vertex of an edge whose first vertex is already saturated, and 
denote it as dz^v). Note that the sum of the first vertex degree and second vertex degree of v is not 
necessarily equal to the degree of v in Step I as v might appear as a second vertex of an edge whose 
first vertex is not yet saturated. We will call such an edge a neglected edge of v. 
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2.2 Step II 



Let A be the set of saturated vertices at the end of Step I, and B = V \ A. Call an edge an A-B edge 
if one end point lies in A and the other end point lies in B, and similarly define A- A edges and B-B 
edges. Given an edge e = (v, w) at time t, if e is an A-B edge, and w.l.o.g. assume that v € B and 
w € A, then direct e alternatingly with respect to v, where the alternation process of Step II continues 
the one from Step I as follows: 

1. If v appeared as a first vertex in Step I at least once, then pick up where the alternation process 
of v as a first vertex in Step I stopped and continue the alternation. 

2. If v did not appear as a first vertex in Step I but did appear as a second vertex of an already 
saturated vertex, then pick up where the alternation process of v as a second vertex of a saturated 
vertex stopped in Step I and continue the alternation. 

3. If v appeared in Step I but does not belong to the above two cases, then consider the first 
neglected edge connected to v, and start the alternation process from the opposite direction of 
this edge. 

4. If none of the above, then start from an out edge. 

Otherwise, if e is an A- A edge or a B-B edge, orient it uniformly at random. Note that unlike Step I, 
the order of vertices of the given edge does not affect the orientation of the edge in Step II. 

For a vertex v € B, let the A-B degree of v be the number of A-B edges incident to v in Step II, 
and denote it as cIab(v)- For v G A, let (1ab{v) = 0. 

3 Proof Outline 

Our approach builds on Frieze's proof of the Hamiltonicity of the random directed graph process [H] 
with some additional ideas. His proof consists of two phases (the original proof consists of three phases, 
but for simplicity, we describe it as two phases). We shall first describe these two phases of Frieze's 
proof, and then point out the modifications that are necessary to accommodate our different setting. 
Let m = (1 + o(l))nlogn be the time at which the random directed graph process has minimum 
in-degree and out-degree 1, and let D n ^ m be the directed graph at time m (throughout this section we 
say that random directed graphs have certain properties if they have the properties whp). 

3.1 Phase 1 : Find a small 1-factor 

In Phase 1, a 1-factor of D n ^ m consisting of at most O(logra) cycles is constructed. To this end, a 
subgraph D^i n ^ out of D n ^ m is constructed which uses only a small number of the edges. Roughly 
speaking, for each vertex, use its first 5 out-neighbors and 5 in-neighbors (if possible) to construct 
—in,5—out- Note that the resulting graph will be similar to a random 5-in 5-out directed graph, but 
still different as some vertices will only have 1 in-neighbor and 1 out-neighbor even at time m. Finally, 
viewing Z?5_j n 5_ «t as a bipartite graph G'(V U V*,E'), where V* is a copy of V, and {u, v*} G E' 
iff u — > v belongs to D^,-i n ^^ ou t, one proves that G 1 has a perfect matching. It turns out that this 
matching can be viewed as a uniform random permutation of the set of vertices V. A well known fact 
about such permutations is that they whp consist of at most O(logn) cycles. 
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3.2 Phase 2 : Combining the cycles into a Hamilton cycle 

In Phase 2, the cycles of the 1-factor are combined into a Hamilton cycle. The technical issue to 
overcome in this step is the fact that in order to construct L>5_j nj 5_ ou t, all of the edges were scanned, 
and now supposedly we have no remaining random edges in the process to combine the cycles of the 
1-factor. However, note that since i^s-m.s-out consists of at most lOn edges, the majority of edges 
need not be exposed. More rigorously, let LARGE be the vertices whose degree is f2(logn/loglogn) 
at time to = 2nlogn/3 in the directed graph process. For the LARGE vertices, its 5 neighbors 
in I?5_j ni 5_ OMt will be determined solely by the edges up to time to, leaving the remaining edges 
(edges after time to) of the process unexposed. Two key properties used in Phase 2 are that whp, (a) 
\LARGE\ = n — o(n 1//2 ), and (b) every cycle of the 1-factor contains many LARGE vertices. Note that 
by (a), out of the remaining nlogn/3 edges, all but o(l)-fraction will connect two LARGE vertices. 
Phase 2 can now be summarized by the following theorem |14j . 

Theorem 3.1. Let V be a set of n vertices and L <ZV be a subset of size at least n — o(n 1//2 ). Assume 
that D is a directed 1-factor over V consisting of at most O(logn) cycles, and the vertices V \ L are 
at distance at least 10 away from each other in this graph. 

If (1/3 — o(l))nlogn L-L edges are given uniformly at random, then whp the union of these edges 
and the graph D contains a directed Hamilton cycle. 

The proof of a slightly stronger version of Theorem 13.11 will be given in Section [6l 

3.3 Comparing with our setting 

The main technical issue in this paper is to reprove Phase 1, namely, the existence of a 1-factor with 
small number of cycles. In |14| . the fact that all vertices have the same distribution in D§—i n §—outi 
led to an argument showing the existence of a matching that translates into a uniform random per- 
mutation. Our case is different because of the orientation rule. We have different types of vertices 
each being oriented in a different way, breaking the nice symmetry. The bulk of our technical work is 
spent in resolving this technical issue. 

Once this is done, that is after achieving the 1-factor, we come up with an analogue of LARGE, 
which we call "saturated". Similarly as in Phase 2 described above, we prove that whp (a') most 
of the vertices are saturated, and (&') every cycle in the 1-factor contains many saturated vertices. 
However, the naive approach results in a situation where one cannot apply Theorem 13.11 ((a') and 
(b') are quantitatively weaker than (a) and (6)). Thus we develop the argument of "compressing" 
vertices of a given cycle. This idea allows us to get rid of all the non-saturated vertices, leading to 
another graph which only has saturated vertices in it. Details will be given in Section 15.21 Once we 
apply the compression argument, we can use Theorem 13. II to finish the proof. Let us mention that the 
compression argument can be applied after Phase 1 in [T3] as well to simplify the proof. 

4 A Typical Random Process 

The following well-known concentration result (see, for example [lj Corollary A. 1.14]) will be used 
several times in the proof. We denote by Bi{n,p) the binomial random variable with parameters n 
and p. 
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Theorem 4.1. (Chernoff's inequality) If X ~ Bi(n,p) and e > 0, then 

F(\X-E[X]\ >eE[X\) < e -^( E W). 

4.1 Classifying Vertices 

To analyze the algorithm it will be convenient to work with three sets of vertices. The first is the set 
of saturated vertices at Step I. Throughout we will use A to denote this set. Let us now consider 
the non-saturated vertices B = V \ A. Here we distinguish between two types. We say that v £ B 
blossoms if there are at least 12 edges of the form {v, A} in Step II (by A we mean an arbitrary 
vertex from A), and let B\ be the collection of vertices which blossom. All the remaining vertices are 
restricted, and is denoted by B<i- Thus every vertex either is saturated (A), blossoms (B\), or is 
restricted (-82)- 

Furthermore, the set of restricted vertices has two important subclasses which are determined by 
the first vertex degree di(v), second vertex degree d%{v), and A-B degree (Iab(v) defined in the previous 
section. We say that a restricted vertex v partially-blossoms if the sum of its first vertex degree, 
second vertex degree, and A-B degree is at least 2. Note that since we stopped the process when the 
graph has minimum degree 2, every vertex v has degree at least 2. Thus, if the above mentioned sum 
is at most 1, then v either has a neglected edge, or a B-B edge connected to it. A useful fact that we 
prove in Lemma 1131 says that whp all such vertices v have one A-B edge (thus dAB{v) = 1), and at 
least one neglected edge. Thus, we call a restricted vertex v not being partially-blossomed, and having 
one A-B edge and at least one neglected edge as a bud. 



4.2 Properties of a Typical Random Process 

In this section we list several properties that hold whp for random edge processes. We will call an 
edge process typical if indeed the properties hold. Let 

mi = — n log n + — n log log n — n log log log n, 7712 = — n log n + — n log log n + n log log log n. 



Note that for a fixed vertex v, the probability of an edge being incident to v is 2n ^± l = - — (this 
is because in our process, each edge is given by an ordered pair of vertices). However as it turns out 
the small order term -4 is always negligible for our purpose, so we will use the probability for this 
event, and remind the reader that the term is omitted. Recall that the stopping time m* is the 
time at which the last vertex of degree one becomes a degree two vertex and the process stops. 

Claim 4.2. Let m* be the stopping time of the random process. Then whp 

Proof. For a fixed vertex v, the probability of an edge being incident to v is about ^. Hence the 
probability of v having degree at most 1 at time m,2 is, 

1 _ l\ m2 _|_ f m Al ■ (l- -V' 2 1 < 31ogn • e ~ log ^-l°s log ^-2 log log log n -q( 1 ^ _ 

n) \ 1 J n \ nj ~ " \n(log log n) 2 / 

Thus by Markov's inequality, whp there is no vertex of degree at most 1 after 777-2 edges. This shows 
that < ?7i2. Similarly, the expected number of vertices having degree at most 1 after seeing mi 
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edges is Q((log log n) 2 ), and by computing the second moment of the number of vertices having degree 
at most 1, we can show that after mi edges whp at least one such vertex exits. This shows that 
m* > mi. The rest of the details are fairly standard and are omitted. □ 

Next we are going to list some properties regarding the different types of vertices. 

Claim 4.3. The number of saturated vertices satisfies whp 

. (log log n) 12 
\A > n\ 1 - 



log 2 n 

Proof. For a fixed vertex v, the probability of v occurring as the first vertex of an edge is (exactly) ^, 
and thus the probability of v ending up non-saturated at Step I is at most 

g( * ){n)-V-n) ^^"'l^^H^H' 
The claim follows from Markov's inequality. □ 

Our next goal is to prove that the restricted vertices consist only of partially-blossomed and bud 
vertices. For that we need the following auxiliary lemma. 

Claim 4.4. Let Ebb be the collection of all B-B edges (in Step II). The graph G m , \ Ebb has whp 

minimum degree 2. 

Proof. If the graph Gr m , \ Ebb has minimum degree less than 2 for some edge process e, then there 
exists a vertex v which gets at most one edge other than a B-B edge, and at least one B-B edge. 
By Claim 14.21 it suffices to prove that the graph whp does not contain a vertex which has at most 
one edge other than a B-B edge at time mi, and at least one B-B edge at time mi- Let A v be the 
event that v is such vertex. Let BS be the event that \B\ < j~f~ — n {B is small), and note that 
F(BS) = 1 - o(l) by Claim [OJ Then we have 

P(G m . \ Ebb has minimum degree less than 2) = P I [J A v ) < n • P (A n BS) + o(l). (1) 

The event A is equivalent to the vertex v receiving k B-B edges, for some k > 0, and at most one 
edge other than a B-B edge at appropriate times. This event is contained in the event C v R T) v & where 
C„ is the event u v appears at most once in Step I", and T> v ^ is the event u cIab(v) < 1 by time mi and 
v receives k B-B edges by time 1712" . Therefore our next goal is to bound 

p(C„ n v v>k n BS) = F(c v n BS) ■ F(T> V)k \c v n BS) < p(c) • F(T> V)k \c v n BS). (2) 

We can bound the probability of the event C v by, 

To bound the event T> v ^ which is ll dAB(v) < 1 at time mi and v receives k B-B edges by time 
7712", note that C v and BS are events which depend only on the first 2 log log n edges (Step I edges). 
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Therefore conditioning on this event does not affect the distribution of edges in Step II (each edge is 
chosen uniformly at random among all possible n 2 pairs). We only consider the case (1ab(v) = 1 (the 
case cIab(v) = can be handled similarly, and turns out to be dominated by the case (1ab(v) = 1). 
Thus to bound the probability, we choose k + 1 edges among the m 2 — 2n log log n edges, let 1 of them 
to be an A-B edge, k of them to be B-B edges incident to v. Moreover, since <1ab{v) < 1 at time mi, 
we know that at least mi — 2n log logn — k — 1 edges are not incident to v. Thus, 

F{V v , k | c v n BS) 

'm 2 -2nloglogn\ f 2\ k+1 (k + 1\ \A\ f\B\\ k f _ 2 x ™i-2niogiog„-fc-i 



k + 1 J \n J \ljn\nj\ n 

By using the inequalities 1 — x < e~ x , \A\ < n, and ( m 2-2rUogiogn-j < m^ 1 , the probability above is 
bounded by 

(k + l)m% +1 (^\ O^f) exp ("n^ 1 ~ 2nl °g lo § n ~ k ~ • ( 4 ) 
Therefore by ©, ©, and ©, 

n v Vfk n BS) < 



O ( {k + l)m k 2 +1 ( -) k+1 ( MV exp (-l( mi - 2nloglogn - k - 1) 
V log n / \n J \ n J \ n 

Plugging the bound \B\ < 11 '° g *° g — - and m 2 < nlogn in the latter, one obtains: 

log n 

n(u\ f l °S^ogn\ /2(loglogn) 12 \ fc (2 

U\k) g exp (mi — 2nloglogn — k — 1) 

V log^n J \ logn J \ n 

By the definition m\ = ^nlogn + \n log logn — n log log logn, this further simplifies to 

/ (loglogn) 3 \ / 2e 2 /"(loglogn) 12 \ fc 
\ n J \ logn J 

Summing over all possible values of k, 



3 //m™-i™.^U2\ k 

o(n- u 



E„ „„. O(k) (log logn) 3 /4(loglogn) 12 \ 

fc =i ifcl n V logn / 

Going back to ([I]), we get that 

^{Gm, \ Ebb has minimum degree less than 2) = n ■ o(n _1 ) + o(l) = o(l). 

Note that as mentioned in the beginning of this section, we used - to estimate the probability of an 
edge being incident to a fixed vertex. This probability is in fact ^ — \, but the term \ will only 
affect the lower order estimates. □ 

Claim 4.5. Every restricted vertex is whp either partially-blossomed, or a bud. 
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Proof. Assume there exists a restricted vertex v which is not partially-blossomed or a bud. Then by 
definition, the sum d\{v)+d 2 {v)+dAB{v) < 1- The possible values of the degrees (di(v),d 2 (v),dAB(v)) 
are (1,0, 0), (0, 1, 0), (0, 0, 1), or (0, 0, 0). Vertices which correspond to (0, 0, 1) will all be bud vertices 
whp by Claim 14.41 It suffices to show then that whp there does not exist vertices which correspond 
to (1, 0, 0), (0, 1, 0), or (0,0,0). Let T be the collection of vertices which have d\(v) + d 2 (v) < 1 and 
dAB(v) = at time mi. By Claim l4~2l it suffices to prove that T is empty. Let BS be the event 
\B\ < (lo ^ o '°f^ )12 n, and note that by Claim H3J ¥(BS) = 1 - o(l). The event {T ^ 0} is the same as 
^vev{v € T}, and thus by the union bound, 

P(T ^ 0) < o(l) + ¥({v € T} n BS) 
veV 

= o(l) + ^ P ({di(u) + < 1} n {d AB (f ) = 0} n £<S) . 

By Bayes equation, the second term of right hand side splits into, 

J^P ({di(t>) + <k(v) < l}riBS)-F(d AB (v) = 0\{d 1 (v) + d 2 (v) < l}n BS) 

v&V 

<^2F(d 1 (v) + d 2 (v) < l)-F(d AB (v) = 0\{d 1 (v) + d 2 (v) < l}nBS). (5) 

v&V 

The probability P(di(v) + d 2 («) < 1) can be bounded by F({di(v) < 1} n {d 2 (t>) < 1}) which satisfies, 

P({di(t>) < 1} n {d 2 {v) < 1}) = P(di(u) < 1) • P(d 2 (u) < 1 1 < 1). 

The term P(di(u) < 1) can be easily calculated as, 

2nloglogn / 2nloglogn \ /i\ / ^ jynloglogn-1 ^ ^ flog l og n 



n y \ 1 J \n J \ n J \ log n 

To estimate P(d 2 (v) < l\di(v) < 1), expose the edges of Step I as follows: First expose all the first 
vertices. Then expose the second vertices whose first vertex is saturated (d 2 (v) is now determined for 
every v E V). The number of second- vertex-spots that are considered is at least 2nloglogn — 12n, 
and thus V(d 2 {v) < l\d\{v) < 1) is at most 

-1 \ 2n log log n— Yin /r>i 1 \ / -i \ / i \ 2n log log n— I2n— 1 /i i 

6 s /2nloglogn\ / 1_\ ^ _ 1A s 6 _ Q I log log n 

n/ \ 1 y \nj \ n) \ log 2 ra 

Thus as a crude bound, we have 

(log logn) 2 



P(di(u) + d 2 (u) < 1) < < 1) • P(d 2 (u) < 1 1 < 1) = O 



log n 



Since di(i>) + d 2 (v) < 1 implies that v € B, and dAs(^) depends only on the Step II edges (which are 
independent from d\(v), d 2 (v), and BS), the second term of the right hand side of equation ©, the 
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probability P (cIab(v) = | {di(v) + ^(f ) < 1} PI BS) can be bounded by 
/ i \A\\ mi ~ 2nl °s lo s n 

f 1 - 2— !— 1 I < exp (-2(mi - 2nloglogn)|A|/n 2 ) 

\ n n J v 

(log log n) 12 



< exp — (log n — 3 log log n — 2 log log log n) 1 5 — 

log n 



(log n) 3 (log log n)' 



n 



< exp (— log n + 3 log log n + 2 log log log n + o(l)) = O 
Therefore in ([5]), 

P(T / 0) < o(l) + V Q f Q f 0°g") 3 0°gl°g") 

^ V log 4 ™ / V " 

= o(l) + o(2?f^Wl). 



□ 



Claim 4.6. The following properties hold whp for restricted vertices: 

(i) There are at most log 13 re such vertices, 

(ii) every such two vertices are at distance at least 3 in G m „ from each other. 

Proof. Since being a restricted vertex is a monotone decreasing property, by Claim 14.21 it suffices to 
prove (i) at time mi. Recall that E>2 is the collection of restricted vertices (a vertex is restricted if it 
is not saturated or blossomed). 

First, condition on the whole outcome of Step I edges (first 2raloglogn edges) and the event that 



B\ < "tog^n — n ' Then the set B is determined, and for a vertex v € B, we can bound the probability 
)f the event v 6 B2 as following 

11 / m2 \ / 2\ l ( 2|^4j \ mi_2nloglosn_£ 



, ,n J \ n 



Use the inequalities mi = ^ralogn + log log n — log log log n < nlogn, 777,2 < nlogn, 1 — x < e x , 
and |j4| = n — \B\ > n ^1 — ^°^°f — 1 to bound the above by 

\r~^ / / (log log re) 12 \\ 

/ (2 log n) exp I —(log n — 3 log log n — 2 log log log n — £) I 1 5 ) I . 

^ V V log n ) ) 

The sum is dominated by I = 11, and this gives 

j . 

Thus the expected size of B2 given the Step I edges is 

E[\B 2 \ I Step I edges] < \B\ ■ O f O^}ognpo^n \ ^ 0((]og]ogn) i4 log ia n) 
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Since the assumptions on A and B holds whp by Claim H~3l we can use Markov inequality to conclude 
that whp there are at most log 13 re vertices in B<i- Let us now prove (ii). 

For three distinct vertices v±,V2 and w in V, let A(v±, V2, w) be the event that w is a common 
neighbor of v% and v%. The probability of there being edges (v±, w) (or (w, vi)) and (v2, w) (or (w, V2)) 
and vi,V2 G B2 can be bounded by first choosing two time slots where (t>i,u>) (or (w,vi)) and (v2,w) 
(or (io,t; 2 )) will be placed, and then filling in the remaining edges so that v±,V2 G B2. We will only 
bound the event of there being edges (vi,w) and (w, V2) in the edge process (other cases can be handled 
in a similar manner). The probability we would like to bound is 

P(3«i ! v 2 ,w,31 < ii,*2 < m 2 ,e tl = («i , w) , e t2 = (w,v 2 ),vi,v 2 G S 2 ). 

By the union bound this probability is at most 

m 2 

= ( w i>w),et 2 = (iu,v 2 ),vi,V2 G B 2 ) (7) 

wi,U2,ui€V t\,ti=\ 

= P(e tl = (vi,w),et 2 = (io,«2))P(«i,«2 G -B 2 |e tl = («i,io),e i2 = (w,v 2 )) 

<- P(«i,«2 G £ 2 |e tl = (ui,io),ei 2 = (w,v 2 )). (8) 

re z — ' 

tl,*2=l 

To simplify the notation we abbreviate P(«i,i>2 G -B2 |et 1 = et 2 = ^2)) by P(i>i,i>2 G 

-E^leii, et 2 ). By using the independence of Step I and Step II edges we have, 

F(vt,v 2 G B 2 \e tl ,e t2 ) = F(vi,v 2 G B\e tl ,e t2 )F(vi,v 2 <£ B 1 \vi,v 2 G B,e tl ,e t2 ). 

For fixed ii and i 2 , we can bound P(«i, i>2 G Sje^, et 2 ) by the probability of ll v\ and V2 appear at most 
22 times combined in Step I as a first vertex other than at time t\ and £2", whose probability can be 
bounded as follows regardless of the value of t\ and t2, 

— k — 

To bound P(«i, w 2 ^ t> 2 G U, e^, e^), it suffices to bound P(vi, U2 ^ -Bil^i, ^2 G i?, &t x , e^ 2 , £kS), 

which can be bounded by the probability of u v± and v 2 receives at most 22 A-B edges combined in 
Step II other than at time t\ and i 2 " . Regardless of the value of t\ and t2, this satisfies the bound, 

V ( m2 \ (-X ( - mi " 2 " 2nloglogn_z 
f^\£ )\n) \ 

Note that - and ^ in this equation should in fact involve some terms of order -3-, but we omitted it 
for simplicity since it does not affect the asymptotic final outcome. By a similar calculation to ©, 
this eventually can be bounded by 0( ° s 2 n ). Thus we have 

, log 26 n 
P(«i > «2GB 2 |e tlJ e ta ) = O l " 
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which by ([8]) and m 2 < nlogn gives, 



¥(3v 1 ,v 2 ,w,31 < h,t 2 < m 2 ,e tl = (vi,w),e t2 = (w,v 2 ),v 1 ,v 2 G B 2 ) < O 




n 



) 



Therefore by Markov's inequality, whp no such three vertices exist, which implies that two vertices 
vi,v 2 £ B 2 cannot be at distance two from each other in G m „. Similarly, we can prove that whp 
every two vertices v±,v 2 £ B 2 are not adjacent to each other, and hence whp every v\,v 2 € B 2 are at 



4.3 Configuration of the edge process 

To prove that our algorithm succeeds whp, we first reveal some pieces of information of the edge 
process, which we call the "configuration" of the process. These information will allow us to determine 
whether the underlying edge process is typical or not. Then in the next section, using the remaining 
randomness, we will construct a Hamilton cycle. 

In the beginning, rather than thinking of edges coming one by one, we regard our edge process 
e = (ei, e 2 , • • • , e m „) as a collection of edges for i = 1, • • • , m* whose both endpoints are not known. 
We can decide to reveal certain information as necessary. Let us first reveal the following. 

1. For t < 2nloglogn, reveal the first vertex of the i-th edge et- If this vertex already appeared 
as the first vertex at least 12 times among the edges ei, • • • , et-i, then also reveal the second 
vertex. 

Given this information, we can determine the saturated vertices, and hence we know the sets A 
and B. Therefore, it is possible to reveal the following information. 

2. For t > 2nloglogn, reveal all the vertices that belong to B. 

The information we revealed determines the blossomed (-Bi), and restricted (B 2 ) vertices. Thus 
we can further reveal the following information. 

3. For t < 2nloglogn, further reveal all the non-revealed vertices that belong to B 2 . 

4. For every edge et = (ft, wt) in which we already know that either vt € B 2 or wt £ B 2 , also reveal 
the other vertex. 

We define the configuration of an edge process as the above four pieces of information. 

We want to say that all the non-revealed vertices are uniformly distributed over certain sets. But 
in order for this to be true, we must make sure that the distribution of the non-revealed vertices is 
not affected by the fact that we know the value of m* (some vertex has degree exactly 2 at time to*, 
and maybe a non-revealed vertex will make this vertex to have degree 2 earlier than to*). This is 
indeed the case, since the last vertex to have degree 2 is necessarily a restricted vertex, and all the 
locations of the restricted vertices are revealed. Thus the non-revealed vertices cannot change the 
value of m*. Therefore, once we condition on the configuration of an edge process, the remaining 
vertices are distributed in the following way: 



distance at least two away from each other. 



□ 
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(i) For t < 2n log log n, if the first vertex of the edge et appeared at most 12 times among ei, • • ■ , et-i, 
then its second vertex is either a known vertex in B2 or is a random vertex in V \ Bi- 

(ii) For t > 2raloglogn, if both vertices of et are not revealed, then et consists of two random vertices 
of A. If only one of the vertices of et is not revealed, then the revealed vertex is in B, and the 
non-revealed vertex is a random vertex of A. 

Definition 4.7. A configuration of an edge process is typical if it satisfies the following. 

(i) The number of saturated and blossomed vertices satisfy \A\ > ft — - — j — § — - — Tij and \B\\ < 

^ lo ^°f — n respectively, 
(ii) The number of restricted vertices satisfies \B%\ < log 13 n. 

(Hi) Every vertex appears at least twice in the configuration even without considering the B-B edges. 

(iv) All the restricted vertices are either partially-blossomed or buds. 

(v) In the non-directed graph induced by the edges whose both endpoints are revealed, every two 
restricted vertices v\ , v% are at distance at least 3 away from each other. 

(vi) There are at least |nlogn edges et fort > 2nloglogn whose both endpoints are not yet revealed. 

Lemma 4.8. The random edge process has a typical configuration whp. 

Proof. The fact that the random edge process has whp a configuration satisfying (i),(iii), and (iv) 
follows from Claims l4~3l 14.41 and 14,51 respectively, (ii) and (v) follow from Claim [3~6l To verify (vi), 
note that by Claim H~2l and l4T3l whp there are at least log re — 2reloglogre edges of Step II, and 
\A\ = (1 — o(l))n. Therefore the probability of a Step II edge being an A- A edge is 1 — o(l), and the 
expected number of A- A edges is (1/2 — o(l))nlogn. Then by Chernoff's inequality, whp there are at 
least in log re ^4-^4 edges. These edges are the edges we are looking for in (vi). □ 



5 Finding a Hamilton Cycle 

In the previous section, we established several useful properties of the underlying graph G m , . In this 
section, we study the algorithm Orient using these properties, and prove that conditioned on the edge 
process having a typical configuration, the graph D mt whp contains a Hamilton cycle (recall that the 
graph D m „ is the set of random edges of the edge process, oriented according to Orient). As described 
in Section El the proof is a constructive proof, in the sense that we describe how to find such a cycle. 
The algorithm is similar to that used in [14] which we described in some details in Section [3] Let us 
briefly recall that it proceeds in two stages: 

1. Find a 1-factor of G. If it contains more than O(logn) cycles, fail. 

2. Join the cycles into a Hamilton cycle. 

The main challenge in our case is to prove that the first step of the algorithm does not fail. Afterwards, 
we argue why we can apply Frieze's results for the remaining step. 



5.1 Almost 5-in 5-out subgraph 

Let Z>5_j nj 5_ out be the following subgraph of Z? m „ . For each vertex v, assign a set of neighbors OUT(v) 
and IN(v), where OUT(v) are out-neighbors of v and IN(v) are in-neighbors of v. For saturated and 
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blossomed vertices, OUT(v) and IN(v) will be of size 5, and for restricted vertices, they will be of size 
1 (thus £> 5 

—in,5—out is not a 5-in 5-out directed graph under the strict definition). 
Let Ei be the edges of Step I (first 2nloglogn edges), and Ei be the edges of Step II (remaining 
edges) . 

• If v is saturated, then consider the first 12 appearances in E\ of v as a first vertex. Some of these 
edges might later be used as OUT or IN for other vertices. Hence among these 12 appearances, 
consider only those whose second vertex is not in B2 ■ By property (v) of Definition 14.71 there 
will be at least 11 such second vertices for a typical configuration. Define OUT{v) as the first 
5 vertices among them which were directed out from v, and IN(v) as the first 5 vertices among 
them which were directed in to v in Orient. 

• If v blossoms, then consider the first 10 A-B edges in E2 connected to v, and look at the other 
end points. Let OUT(v) be the first 5 vertices which are an out-neighbor of v and IN(v) be the 
first 5 vertices which are an in-neighbor of v. 

A partially-blossomed vertex, by definition, has di(v) + d,2{v) + cIab{v) > 2, and must fall into one 
of the following categories, (i) d\(v) > 2, (ii) ^(v) > 2, (iii) dAB{v) > 2, (iv) d±(v) = 1, dz(v) = 1, 
(v) d\(v) = 1, dAB{v) = 1, and (vi) d\(v) = 0, ^(w) = 1, g?ab( w ) = 1- If it falls into several categories, 
then pick the first one among them. 

• If v partially-blossoms and d\(v) > 2, consider the first two appearances of v in E\ as a first 
vertex. The first is an out-edge and the second is an in-edge (see Section 12. ip . 

• If v partially-blossoms and d2(v) > 2, consider the first two appearances of v in E\ as a second 
vertex whose first vertex is saturated. The first is an in-edge and the second is an out-edge (see 
Section EZQ. 

• Ifv partially-blossoms and dAs{v) > 2, consider the first two A-B edges in E2 incident to v. One 
of it is an out-edge and the other is an in-edge. Note that unlike other cases, the actual order of 
in-edge and out-edge will depend on the configuration. But since the configuration contains all 
the positions at which v appeared in the process, the choice of in-edge or out-edge only depends 
on the configuration and not on the non-revealed vertices (note that this is slightly different from 
the blossomed vertices). 

• If v partially-blossoms and d\(v) = 1, d2(v) = 1, consider the first appearance of v in E\ as a 
first vertex, and the first appearance of v in E% as a second vertex whose first vertex is saturated. 
The former is an out-edge and the latter is an in-edge. 

• If v partially-blossoms and d\(v) = 1, dAB(v) = 1, consider the first appearance of v in E\ as 
a first vertex, and the first A-B edge connected to v in i?2- The former is an out-edge and the 
latter is an in-edge (see rule 1 in Section 12.21) . 

• If v partially-blossoms and d\{v) = 0, ^(u) = 1, dAB{v) = 1, consider the first appearance of v 
in Ei as a second vertex whose first vertex is saturated, and the first A-B edge connected to v 
in E2. The former is an in-edge and the latter is an out-edge (see rule 2 in Section \2.2\\ . Thus 
we can construct OUT(v) and IN(v) of size 1 each, for all partially-blossomed vertices. 

• If v is a bud, then consider the first (and only) A-B edge connected to v. Let this edge be e s . 
For a typical configuration, by property (iii) of Definition 14.71 we know that v has a neglected 
edge connected to it. Let et be the first neglected edge of v. By property (v) of Definition 14. 7\ we 
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know that the first vertex of the neglected edge is either in A or B±. According to the direction 
of this edge, the direction of e s will be chosen as the opposite direction (see rule 3 in Section 
I2.2j) . As in the partially-blossomed case with cIab(v) > 2, the direction is solely determined by 
the configuration. Thus we can construct OUT(v) and IN(v) of size 1 each (which is already 
fixed once we fix the configuration). 

This in particular shows that D mt has minimum in-degree and out-degree at least 1, which is clearly 
a necessary condition for the graph to be Hamiltonian. A crucial observation is that, once we condition 
on the random edge process having a fixed typical configuration, we can determine exactly which edges 
are going to be used to construct the graph Ds-m^-out just by looking at the configuration. 

For a set X, let RV{X) be an element chosen independently and uniformly at random in the set 
(consider each appearance of RV(X) as a new independent copy). 

Proposition 5.1. Let V' = V \ B%. Conditioned on the edge process having a typical configuration, 
—in,5—out has the following distribution. 

(i) If v is saturated, then OUT(v) and IN(v) are a union of 5 copies of RV(V). 

(ii) If v blossoms, then OUT(v) and IN{v) are a union of 5 copies of RV(A). 

Proof. For a vertex v € V, the configuration contains the information of the time of arrival of the 
edges that will be used to construct the set OUT(v) and IN(v). 

If v is a saturated vertex, then we even know which edges belong to OUT(v) and IN(v) (if there 
are no E>2 vertices connected to the first 12 appearances of v as a first vertex, then the first five odd 
appearances of v as a first vertex will be used to construct OUT(v), and the first five even appearances 
of v as a first vertex will be used to construct IN(v)). Since the non-revealed vertices are independent 
random vertices in V', we know that OUT(v) and IN(v) of these vertices consist of 5 independent 
copies of RV(V'). 

If v blossoms, then the analysis is similar to that of the saturated vertices. However, even though 
the configuration contains the information of which 10 edges will be used to construct OUT(v) and 
IN(v), the decision of whether the odd edges or the even edges will be used to construct OUT{y) 
depends on the particular edge process (this is determined by the orientation rule at Step I). However, 
since the other endpoints are independent identically distributed random vertices in A, the distribution 
of OUT(y) and IN(v) is not be affected by the previous edges, and is always RV(A) (this is analogous 
to the fact that the distribution of the outcome of a coin flip does not depend on whether the initial 
position was head or tail). □ 

5.2 A small 1-factor 

The main result that we are going to prove in this section is summarized in the following proposition: 

Proposition 5.2. Conditioned on the random edge process having a typical configuration, there exists 
whp a 1-factor of Dh-in,h-mit containing at most 21ogn cycles, and in which at least 9/10 proportion 
of each cycle are saturated vertices. 

Throughout this section, rather than vaguely conditioning on the process having a typical con- 
figuration, we will consider a fixed typical configuration c and condition on the event that the edge 
process has configuration c. Proposition 15.21 easily follows once we prove that there exists a Hamilton 
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cycle whp under this assumption. The reason we do this more precise conditioning is to fix the sets 
A,B,Bi,E>2 and the edges incident to vertices of B2 (note that these are determined solely by the 
configuration). In our later analysis, it is crucial to have these fixed. 

To prove Proposition 15.21 we represent the graph D5— in,5-oiit as a certain bipartite graph in which 
a perfect matching corresponds to the desired 1-factor of the original graph D m * . Then using the edge 
distribution of D^i n ^_ out given in the previous section, we will show that the bipartite graph whp 
contains a perfect matching. The proof of Proposition 15.21 will be given at the end after a series of 
lemmas. 

Define a new vertex set V* = {v*\ v € V} as a copy of V, and for sets X C V, use X* to denote 
the set of vertices in V* corresponding to X. Then, in order to find a 1-factor in Z>5_j„ 5_ 0lt £, define an 
auxiliary bipartite graph BIP(V, V*) over the vertex set VUF* whose edges are given as following: for 
every (directed) edge (u,v) of D^^i n ^ out , add the (undirected) edge (u,v*) to BIP. Note that perfect 
matchings of BIP has a natural one-to-one correspondence with 1-factors of JD5— tn5— out- Moreover, 
the edge distribution of BIP easily follows from the edge distribution of -t>5-in,5-o«t- We will say that 
—in,5—out is the underlying directed graph of BIP. A permutation o~ of V* acts on BIP to construct 
another bipartite graph which has edges (v,a(w*)) for all edges (v,w*) in BIP. 

Our plan is to find a perfect matching which is (almost) a uniform random permutation, and show 
that this permutation has at most O(logn) cycles (if it were a uniform random permutation, then this 
is a well-known result, see, e.g., [12]). Since our distribution is not a uniform distribution, we will rely 
on the following lemma. Its proof is rather technical, and to avoid distraction, it will be given in the 
end of this subsection. 

Lemma 5.3. Let X be subset ofV. Assume that whp, (i) BIP contains a perfect matching, (ii) every 
cycle of the underlying directed graph Dx,_i n ^ out contains at least one element from X, and (Hi) the 
edge distribution of BIP is invariant under arbitrary permutations of X* . Then whp, there exists a 
perfect matching which when considered as a permutation contains at most 2 log n cycles. 

The next set of lemmas establish the fact that BIP satisfies all the conditions we need in order to 
apply Lemma 15. 31 First we prove that BIP contains a perfect matching. We use the following version 
of the well-known Hall's theorem (see, e.g., |10j). 

Theorem 5.4. Let V be a bipartite graph with vertex set XUY and \X\ = \Y\ = n. If for all X' C X 
of size \X'\ < n/2, \N(X')\ > \X'\ and for all Y' C Y of size \Y'\ < n/2, \N(Y')\ > \Y'\, then G 
contains a perfect matching. 

Lemma 5.5. The graph BIP contains a perfect matching whp. 

Proof. We will verify Hall's condition for the graph BIP to prove the existence of a perfect matching. 
Recall that BIP is a bipartite graph over the vertex set V L)V*. 

Let us show that every set D C V of size \D\ < n/2 satisfies \N(D)\ > \D\. This will be done 
in two steps. First, if D C B2, then this follows from the fact that OUT{v) are distinct sets for all 
v £ B2, (if they were not distinct, then there will be two restricted vertices which are at distance 2 
away, and it violates property (v) of Definition I4.7[) . Second, we prove that for D C V \ B2, 

I N(D) n (V* \ N(B 2 )) I > 

It is easy to see that the above two facts prove our claim. 



17 



Let D C V \ B 2 be a set of size at most k < n/2. The inequality |iV(D) n (V* \ iV(5 2 ))| < |-D| 
can happen only if there exists a set N* C V* \ N{B2) such that |iV*| < k, and for all v G D all 
the vertices of OUT(v) belong to N* U N(B2). Since D C V \ B2, every vertex in D has 5 random 
neighbors distributed uniformly over some set of size (1 — o(l))n, and thus the probability of the above 
event happening is at most, 



fn\ 2 ( \N{B 2 )\ + \N*\ \ bk / e 2 n 2 (log 13 n + fc) 5 \ fc / 9(log 13 nH 
V/fcJ V (l-o(l))n y - V k 2 • (1 - o(l))n 5 / ~V fc2 ™ 3 

For the range 9n/20 < k < n/2, we will use the following bound 



v /cy v( 1 -°( 1 )) n 

Summing over all choices of k we get 

n/2 



n\ / log n + k 



^ V(l-o(l)K 

<E(^)' + e ( 9(1 °CV t)5 )^ E 

^ V few / ^ V few / ^ 

fc=l V 7 fc=log 14 n V 7 fc=9n/20 

it 9n/20 



lof 14 

< 



E (— ^— J + E (—J +°d) = °(i) 

fc=l V 7 fc=log 14 n V 7 



This finishes the proof that w/ip \N(D)\ > \D\ for all D C of size at most n/2. Similarly, for 
sets D* C y* of size \D*\ < n/2, using the sets IN(v) instead of OUT(v) we can show that whp 
\N(D*)\ > \D*\ in BIP. □ 

For restricted vertices v, the sets OUT(v) and IN(v) are of size 1 and are already fixed since we 
fixed the configuration. Thus the edge corresponding to theses vertices will be in BIP. Let 

A = A\(U veB2 OUT(v)), 

and let A* be the corresponding set inside V* (note that A and A* are fixed sets). This set will be 
our set X when applying Lemma [5T31 We next prove that every cycle of L>5_j nj 5_ ou t contains vertices 
of A. 

Lemma 5.6. Whp, every cycle C of -Ds-m^-airf contains at least |~yg|C|] vertices of A. 

Proof. Recall that by Proposition ^. 11 for vertices v G V\B-2, the set OUT(v) and IN(v) are uniformly 
distributed over V \ B2, or A. Therefore, for a vertex w G B2, the only out-neighbor of w is OUT{w), 
and the only in-neighbor is IN(w) (note that they are both fixed since we fixed the configuration). 
Also note that, 

\V\A\<\V\A\ + \B 2 \ < + 2\B 2 \ < (1 ° gl °g w)12 n + 21og 13 n < % 



log n ' logn 
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We want to show that in the graph D^i n ^- out , whp every cycle of length k has at most k/10 
points from V \ A, for all k = 1, . . . , n. Let us compute the expected number of cycles for which this 
condition fails and show that it is o(l). First choose k vertices vi,V2, • • • , Ufe (with order) and assume 
that a of them are in Bi- Then since we already know the (unique) out-neighbor and in- neighbor for 
vertices in B2, for the vertices vx,- ■ ■ , to form a cycle in that order, we must fix 3a positions (a for 
the vertices in B2, and 2a for the in-, and out-neighbors of them by property (y) of Definition 14. 7j) . 
Assume that among the remaining k — 3a vertices, £ vertices belong to V \ (A U B%). Then for there 
to be at least [fc/10] vertices among x>\, ■ ■ ■ , Vf. not in A, we must have 3a + £ > \k/10] . There are at 
most 3 fc ways to assign one of the three types A, B2, and V \ (A U B2) to each of V\, ■ ■ ■ , v^. Therefore 
the number of ways to choose k vertices as above is at most 

3 fc • n k - f - 3a \V \ A\ e \B 2 \ a < 3 k ■ n k - l ^ a ( -^-Y (log 13 n) a 

\logny ' 

There are k — 2a random edges which has to be present in order to make the above k vertices 
into a cycle. For all i < k — 1, the pair (vi,Vi + i) can become an edge either by Vi+i € OUT(vi) or 
Vi € IN(vi + \) (and also for the pair (i>i,i>fc)). There are two ways to choose where the edge {vi,Vi + \\ 
comes from, and if both m and Vi+i are not in B2, then {vi, Vj+i} will become an edge with probability 
at most h_3i\\ n ■ Therefore the probability of a fixed vx, ■ ■ ■ ,Vk chosen as above being a cycle is at 

most 2 ( (i- (i)) n ) 1 an d the expected number of such cycles is at most 

2" ( 5 -—) ^ . 3 fc . n k - £ - 3a ( — V (log 13 n) a 
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where we used 3a + £ > \k/10] for the second inequality. Sum this over < £ < k and < a < k and 
we get 



" * fc / W% \ a / 40 y n (\^f h u f !°_ 




0(1) 



which proves our lemma. 

□ 

The following simple observation is the last ingredient of our proof. 
Lemma 5.7. The distribution of BIP is invariant under the action of an arbitrary permutation of 

A*. 

Proof. This lemma follows from the following three facts about the distribution of I?5_i nj 5_ on f . First, 
all the saturated vertices have the same distribution of IN. Second, for the vertices v 6 V \ B2, the 
distribution of OUT and IN is uniform over a set which contains all the saturated vertices (for some 
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vertices it is V \ B2, and for others it is A). Third, for the vertices v € B2, the set OUT(v) lies 
outside A by definition. Therefore, the action of an arbitrary permutation of A* does not affect the 
distribution of BIP. □ 



Note that here it is important that we fixed the configuration beforehand, as otherwise the set A* will 
vary, and a statement such as Lemma 15,71 will not make sense. 

By combining Lemmas 15. 3\ 15.51 15.61 and 15.71 we obtain Proposition 15.21 

Proof of Proposition \5.2[ Lemmas 15.51 15.61 and 15.71 show that the graph BIP has all the properties 
required for the application of Lemma 1531 (we use X = A). Thus we know that whp, D^i n ^- out has 
a 1-factor containing at most 21ogn cycles, and in which at least 9/10 proportion of each cycle are 
saturated vertices (second property by Lemma |5"U|) . □ 

We conclude this subsection with the proof of Lemma 15.31 

Proof of Lemma \5.3l For simplicity of notation, we use the notation B for the random bipartite graph 
BIP. Note that both a 1-factor over the vertex set V and a perfect matching of (V, V*), can be 
considered as a permutation of V. Throughout this proof we will not distinguish between these 
interpretations and treat 1-factors and perfect matchings also as permutations. 

First, let / be an arbitrary function which for every bipartite graph, outputs one fixed perfect 
matching in it. Then, given a bipartite graph T over the vertex set V U V* , let <3? be the random 
variable <£(r) := T f(rT), where r is a permutation of the vertices A* chosen uniformly at random. 
Since the distribution of B and the distribution of tB are the same by condition (Hi), for an arbitrary 
permutation a of A*, $ has the following property, 

P($(£) = </>) = Hr- l f(rB) = <f>) ( = } F^ra)' 1 f(raB) = 0) 

= ¥( T - l f(raB) = ac))) = P(t -1 /(tB) = <r<f>) = P(*(B) = a<f>). (9) 

In the (*) steps, we used (Hi), and the fact that if r is a uniform random permutation of A* , then so 
is to", and therefore, B,tB, and tuB all have identical distribution. 

Define a map II from the 1-factors over the vertex set V to the 1-factors over the vertex set A 
obtained by removing all the vertices that belong to V \A from every cycle. For example, a cycle of the 
form (xiX2yiy2X3i)3X4 : ) will become the cycle (X1X2X3X4) when mapped by II (where x±, ... ,x± e A, 
and 2/1, 2/2; 2/3 £ V\A). Note that if all the original 1-factors contained at least one element from A, 
then the total number of cycles does not change after applying the map II. This observation combined 
with condition (ii) implies that it suffices to obtain a bound on the number of cycles after applying II. 

Let a, p be permutations of the vertex set A* . We claim that for every 1-factor <fi of the vertex set 
V, the equality a ■ H(<ft) = Tl(a ■ 4>) holds. This claim together with ([9]) gives us, 

P(II($(B)) = p)= F(<f>(B) £ U-\p)) B P(<r$(B) e U~ 1 (p)) = F(U(a^(B)) = p) 
= F(a ■ II($(B)) = p)= P(n(*(B)) = a~ x p). 

Since a and p were an arbitrary permutation of the vertex set A, we can conclude that conditioned 
on there existing a perfect matching, n(<3?(B)) has a uniform distribution over the permutations of A. 
It is a well-known fact (see, e.g., [12]) that a uniformly random permutation over a set of size n has 
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whp at most 21ogn cycles. Since B whp contains a perfect matching by condition (i), it remains to 
verify the equality a ■ 11(0) = I1(cj ■ eft). Thus we conclude the proof by proving this claim. 

For a vertex x £ A, assume that the cycle of (ft which contains x is of the form (• • • xy\yi ■ ■ ■ Uk%+ • • • ) 
(k > 0) for yi, . . . , y^ E V\A. Then by definition 11(0) (a;) = x + , and thus (a-U(<ft))(x) = a{x + ). On the 
other hand, since a only permutes A and fixes every other element of V, we have (a-<f>){x) = <j(y\) = yi, 
and (a ■ 4>){y%) = y%+x for all z < k — 1, and (a ■ (ft)(yk) = cr(x+). Therefore the cycle in a ■ (ft 
which contains x will be of the form (• • • xy\y2 ■ ■ ■ yk<j(x+) ■ ■ ■ ) , and then by definition we have 
(IL(a ■ (ft))(x) = a(x + ). ' □ 

5.3 Combining the cycles into a Hamilton cycle 

Assume that as in the previous subsection, we started with a fixed typical configuration c, conditioned 
on the edge process having configuration c, and found a 1-factor of L>5_j nj 5_ out by using Proposition 
15.21 Since this 1-factor only uses the edges which have been used to construct the graph Z?5-tn,5-ouij 
it is independent of the A-A edges in Step II that we did not reveal. Moreover, by the definition of a 
typical configuration, there are at least |nlogn such edges. Note that the algorithm gives a random 
direction to these edges. So interpret this as receiving |nlogn randomly directed A-A edges with 
repeated edges allowed. Then the problem of finding a directed Hamilton cycle in D mit can be reduced 
to the following problem. 

Let V be a given set and A be a subset of size (1 — o(l))n. Assume that we are given a 1-factor 
over this vertex set, where at least 9/10 proportion of each cycle lies in the set A. If we are given 
log re additional A-A edges chosen uniformly at random, can we find a directed Hamilton cycle? 

To further simplify the problem, we remove the vertices V \ A out of the picture. Given a 1-factor 
over the vertex set V, mark in red, all the vertices not in A. Pick any red vertex v, and assume 
that V-,v, w+ € V appear in this order in some cycle of the given 1-factor. If v- ^ v+, replace the 
three vertices V-,v,v+ by a new vertex v', where v' takes as in-neighbors the in-neighbors of u_, and 
as out-neighbors, the out-neighbors of v+. We call the above process as a compression of the three 
vertices v—,v,v+. A crucial property of compression is that every 1-factor of the compressed graph 
corresponds to a 1-factor in the original graph (with the same number of cycles). Since a directed 
Hamilton cycle is also a 1-factor, if we can find a Hamilton cycle in the compressed graph, then we 
can also find one in the original graph. 

Now for each v £ V \ A, compress the three vertices v_,v,v + into a vertex v' and mark it red if 
and only if either V- or v + is a red vertex. This process always decreases the number of red vertices. 
Repeat it until there are no red vertices remaining, or u_ = v + for all red vertices v. As long as there is 
no red vertex in a cycle of length 2 at any point of the process, the latter will not happen. Consider a 
cycle whose length was k at the beginning. Since at least 9/10 proportion of each cycle comes from A 
and every compression decreases the number of vertices by 2, at any time there will be at least (8/10)fc 
non-red vertices, and at most (1/10) A: red vertices remaining in the cycle. Thus if a cycle has a red 
vertex, then its length will be at least 9, and this prevents length 2 red cycles. So the compressing 
procedure will be over when all the red vertices disappear. Note that since |V \ A\ = \B\ = o(n), the 
number of remaining vertices after the compression procedure is over is at least n — 2\B\ = (1 — o(l))n. 
As mentioned above, it suffices to find a Hamilton cycle in the graph after the compression process is 
over. 
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Another important property of this procedure is related to the additional A-A edges that we are 
given. Assume that v is the first red vertex that we have compressed, where the vertices V-,v,v+ 
appeared in this order in some 1-factor. Further assume that v— and w+ are not red vertices. Then 
since the new vertex v' obtained from the compression will take as out-neighbors the out-neighbors of 
t>+, and in-neighbors the in- neighbors of V-, we may assume that this vertex v' is a vertex in A from 
the perspective of the new |nlogn edges that will be given. 

This observation shows that every pair of vertices of the compressed graph has the same probability 
of being one of the new ^nlogn edges. Since the number of vertices reduced by o(n), only o(n log n) 
of the new edges will be lost because of the compression. Thus whp we will be given — o(l))nlogn 
new uniform random edges of the compressed graph. 

Theorem 5.8. For a typical configuration c, conditioned on the random edge process having configu- 
ration c, the directed graph -D m „ whp contains a Hamilton cycle. 

Proof. By Proposition l5.2l there exists whp a perfect matching of BIP which corresponds to a 1-factor 
in D mt consisting of at most 21ogn cycles. Also, at least 9/10 proportion of the vertices in each cycle 
lies in A. After using the compression argument which has been discussed above, we may assume 
that we are given a 1-factor over some vertex set of size (1 — o(l))n. Moreover, the random edge 
process contains at least (| — o(l))nlogn additional random directed edges (distributed uniformly 
over that set). By Theorem 13.11 with L being the whole vertex set, we can conclude that whp the 
compressed graph contains a directed Hamilton cycle, and this in turn implies that D mt contains a 
directed Hamilton cycle. □ 

Corollary 5.9. The directed graph D m * whp contains a Hamilton cycle. 

Proof. Let e be a random edge process. Let D = D mt (e) and HAM. be the collection of directed 
graphs that contain a directed Hamilton cycle. For a configuration c, denote by e £ c, the event that 
e has configuration c. If e G c for some typical configuration c, then we say that e is typical. 

By Theorem 15.81 we know that for any typical configuration c, ¥(D ^ HAM\e £ c) = o(l), from 
which we know that ¥({D (jz HAM} n {e is typical}) = o(l). On the other hand, by Lemma 14.81 we 
know that the probability of an edge process having a non-typical configuration is o(l). Therefore 
whp, the directed graph D is Hamiltonian □ 

6 Going back to the original process 

Recall that the distribution of the random edge process is slightly different from that of the random 
graph process since it allows repeated edges and loops. In fact, one can show that at time m*, the 
edge process whp contains at least f2(log 2 n) repeated edges. Therefore, we cannot simply condition 
on the event that the edge process does not contain any repeated edges or loops to obtain our main 
theorem for random graph processes. Our next theorem shows that there exists an on-line algorithm 
OrientPrime which successfully orients the edges of the random graph process. 

Theorem 6.1. There exists a randomized on-line algorithm OrientPrime which orients the edges 
of the random graph process, so that the resulting directed graph is Hamiltonian whp at the time at 
which the underlying graph has minimum degree 2. 
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The algorithm OrientPrime will mainly follow Orient but with a slight modification. Assume 
that we are given a random graph process (call it the underlying process). Using this random graph 
process, we want to construct an auxiliary process whose distribution is identical to the random edge 
process. Let t = 1 at the beginning and at be the number of distinct edges up to time t in our auxiliary 
process (disregarding loops). Thus a\ = 0. At time t, with probability (2at + n)/n 2 we will produce 
a redundant edge, and with probability 1 — (2a< + n)/n 2 , we will receive an edge from the underlying 
random graph process. Once we decided to produce a redundant edge, with probability 2at/(2at + n) 
choose uniformly at random an edge out of the at edges that already appeared, and with probability 
n/(2af + n) choose uniformly at random a loop. Let e t be the edge produced at time t (it is either a 
redundant edge, or an edge from the underlying process), and choose its first vertex and second vertex 
uniformly at random. One can easily check that the process (e\, e%, ■ ■ ■ , ) has the same distribution 
as the random edge process. 

In the algorithm OrientPrime, we feed this new auxiliary process into the algorithm Orient and 
orient the edges accordingly. Since the distribution of the auxiliary process is the same as that of 
the random edge process, Orient will give an orientation which whp contains a directed Hamilton 
cycle. However, what we seek for is a Hamilton cycle with no redundant edge. Thus in the edge 
process, whenever we see a redundant edge that is a repeated edge (not a loop), color it by blue. In 
order to show that OrientPrime gives a Hamiltonian graph whp, it suffices to show that we can 
find a Hamilton cycle in D mt which does not contain a blue edge (note that loops cannot be used in 
constructing a Hamilton cycle). We first state two useful facts. 

Claim 6.2. Whp, there are no blue edges incident to B used in constructing D§—i n §— ou t. 

Proof. The expected number of blue edges incident to B in Step I used in constructing D^-i n ^- ou t can 
be computed by choosing two vertices v and w and then computing the probability that v £ B, and 
(v, w) or (w, v) together appears twice among Step I edges. The probability that v appears as a first 
vertex exactly i times is ^ nlo ^°s n ^ (I^ 1 (i _ _^ nk, g lo g™ l _ Condition on the event that v appeared 
i times as a first vertex for some i < 12 (and also reveal the i positions in which v appeared). We 
then compute the probability that some two Step I edges are (v,w) or (w,v). There are three events 
that we need to consider. First is the event that (v,w) appears twice, whose probability is Q) (-) • 
Second is the event that (v, w) appears once and (w, v) appears once, whose probability is at most 
^•niogkign^ 1 • (i)^- Third is the event that (w,v) appears twice, whose probability is at most 

^niogk>gn-j ^_J__^ _ Combining everything, we see that the expected number of Step I blue edges 
incident to B is at most, 
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The main term comes from i = 11, and the third term in the final bracket. Consequently, we can 
bound the expectation by 

+ ( nhS ^") (I)" (l - ]\°'° t '°"'~ U . /nloglognN / 1 V = o(1) . 
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We then would like to compute the expected number of blue edges incident to B in Step 2 used in 
constructing D^i n ^ out . Condition on the first vertices of the Step I edges so that we can determine 
the sets A and B. By Claim |4TB"1 we may condition on the event \B\ = 0{ ^°^°f ~^ — )• Fix a vertex 
v € B, and expose all appearances of v in Step II, and note that only the first 10 appearances are 
relevant. By Claim 14.41 it suffices to bound the probability of the event that there exists a vertex 
w € A such that (v, w) or (w, v) appears twice among the at most 24 Step I edges where v or w are 
the first vertices, and the at most 10 Step II edges which we know is going to be used to construct the 
OUT and IN of the vertex v. Therefore the expectation is 

|B| .„7MV = f(!2ll2|^„A.fMV = 0(1) . 



n J V log n ) V n 

□ 

Claim 6.3. Whp, there are at most logn blue edges used in constructing Dh-in,5-mit- 

Proof. By Claim [6T2| we know that whp, all the blue edges used in constructing i^s-j^s-out are incident 
to A. Therefore it suffices to show that there are at most logn blue edges among the Step I edges. 
The expected number of such edges can be computed by choosing two vertices v, w, and computing 
the probability that (v, w) or (w, v) appears twice. Thus is at most 

2 AiloglogrA / 2 \ 2 

n ■{ 2 JUv =°( l0 ^)- 

Consequently, by Markov's inequality, we can derive the conclusion. □ 
Claim 6.4. Whp, each vertex is incident to at most one blue edge. 

Proof. It suffices to show that there does not exist three distinct vertices v, W\,W2 such that both 
{v,w\} and {v,W2} appear at least twice. The probability of this event is at most 

n\ ( m2\ /4\ / 2 x 
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Now assume that we found a 1-factor as in Section 15.21 By Claim 16.31 whp, it contains at most 
logn blue edges. Then after performing the compression process given in the beginning of Section 
15.31 by Claim 16.21 the number of blue edges remains the same as before. Therefore, if we can find a 
Hamilton cycle in the compressed graph which does not use any of the blue edges, then the original 
graph will also have a Hamilton cycle with no blue edges. Thus our goal now is to combine the cycles 
into a Hamilton cycle without any blue edges, by using the non-revealed ^4-^4 edges. 

In order to do this, we provide a proof of a slightly stronger form of Theorem 13.11 for L = V . 
In fact, it can be seen that when combined with the compression argument, this special case of the 
theorem implies the theorem for general L. Note that we have at least ral g gn non-revealed ^4-^4 edges 
remaining after finding the 1-factor described in the previous paragraph. Note that these edges cannot 
create more blue edges in the 1-factor we previously found, since all the ^4-^4 edges used so far appears 
earlier in the process than these non-revealed edges. We will find a Hamilton cycle in two more phases. 
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The strategy of our proof comes from that of Frieze [13] • In the first phase, given a 1-factor consisting 
of at most O(logn) cycles, we use the first half of the remaining non-revealed A- A edges to combine 
some of the cycles into a cycle of length n — o(n). In this phase, we repeatedly combine two cycles of 
the 1-factor until there exists a cycle of length n — o{n). 

Lemma 6.5. Whp, there exists a 1-factor consisting of O (log n) cycles, one of which is of length 
n — o{n). Moreover, this 1-factor contains at most O(logn) blue edges. 

Proof. Condition on the conclusion of Claim 16.31 Then we are given a 1-factor consisting of at most 
clog n cycles and containing at most log n blue edges. Our goal is to modify this 1-factor into a 1-factor 
satisfying the properties as in the statement. Consider the non-revealed random A-A edges we are 
given. Since we will use only the first half of these edges, we have at least wl ° g " random A-A edges 
given uniformly among all choices. Let En be these edges. Partition En as Eq U E\ U • • • U E c \ ogn , 
where Eq is the first half of edges, E\ is the next 2c iogn P ro P or ti° n of edges, Ei is the next 2c iogn 
proportion of edges, and so on. Thus \Eq\ = \\En\ and \E\\ = ■■■ = |-E c i ogn | = 2e iogn \En\- Since 

se tl 

log n \ ri log 1 / 2 



\Eo\ > " *°2 ~ i by applying Chernoff's inequality and taking the union bound, we can see that whp, for 
every set of vertices X of size at least \X\ > w " 2 , there exists at least \ X\ 1 ^^ > — |g 



log 1 

edges of Eq between X and V \ X. Condition on this event. 

Assume that the 1-factor currently does not contain a cycle of length at least n — t i/ 2 ~ • Then 
we can partition the cycles into two sets so that the number of vertices in the cycles belonging to each 
part is between - — and n — - — V^—. Thus by the observation above, there exist at least n °f B — - 

r log 1/2 n log 1/2 n J 4 ° 

edges of Eq between the two parts. Let (y, w) be one such edge. Let v + be the vertex that succeeds v 
in the cycle of the 1-factor that contains v, and let w~ be the vertex that precedes w in the cycle of 
the 1-factor that contains w. If (w~,v + ) € E\, then the cycle containing v and the cycle containing 
w can be combined into one cycle (see Figure [1]). Therefore, each edge in Eq gives rise to some pair e 
for which if e 6 E±, then some two cycles of the current 1-factor can be combined into another cycle. 
The probability of no such edge being present in E\ is at most 

/_J_ relog 1/2 n \ lEl1 -(log 1 / 2 n/(48n)).(|^|/(2clogn)) < _ n(log l/2 n) 

^ n 2 ' 48 j _e ' ~ 6 

Therefore with probability 1 — e _0 ( logl/2 n ) , we can find an edge in Eq and an edge in E\ which together 
will reduce the total number of cycles in the 1-factor by one. 

We can repeat the above using E{ instead of E\ in the i-th step. Since the total number of cycles in 
the initial 1-factor is at most clogn, the process must terminate before we run out of edges. Therefore 
at some step, we must have found a 1-factor that has at most O(logn) cycles, and contains a cycle of 
length n — o(n). It suffices to check that the estimate on the number of blue edges hold. Indeed, every 
time we combine two cycles, we use two additional edges which are not in the 1-factor, and therefore 
by the time we are done, we would have added O(logn) edges to the initial 1-factor. Therefore even 
if all these edges were blue edges, we have O(logra) blue edges in the 1-factor in the end. □ 

Consider a 1-factor given by the previous lemma. In the second phase, we use the other half of the 
remaining new random edges to prove that the long cycle we just found, can "absorb" the remaining 
cycles. Let P = (vq, ■ ■ ■ ,vi) be a path of a digraph. If there exist two edges (^,^+1) and (vi,Vj) for 
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1 < i < £ and i + 1 < j < £, then we can rotate the path P using Vi and Vj—i as breaking points to 
obtain a new path (vq,vi, • • • ,Vi, Vj,Vj + i, ■ ■ ■ , V£,Vi + \,Vi + 2, • • • , Vj-i) (see Figure HJ. We call the 
intermediate point of this rotation. Note that if the graph contains the edge (vj-i,vo), then one can 
close the path into a cycle. Our strategy is to repeatedly rotate the given path until one can find such 
an edge and close the path (see Figure [TJ . 

Further note that the path obtained from P by rotating it once as above can be described as 
following. Let Pi,P2,P3 be subpaths of P obtained by removing the edges (vi,Vi + \) and (vj-\,Vj). 
Then there exists a permutation ir of the set [3] such that the new path is the path obtained by 
concatenating P n m , P-k{%) > Arte) (i n order). More generally, assume that we rotate the path P in 
total s times by using distinct breaking points v ai ,v a2 , ■ ■ ■ ,v a2g . Let Pi,-- - ,-P2s+i be the subpaths 
of P obtained by removing the edges {v aj ,v a .+i) for 1 < j < 2s. Then there exists a permutation 
a of the set [2s + 1] such that the path we have in the end is the path obtained by concatenating 
P a {X) > P(t(2) > ' " ) P(j(2s+i)- We will use this fact later. Note that it is crucial to have distinct breaking 
points here. 

After finding a 1-factor described in Lemma |6.5| there are at least wl ° g " non-revealed A- A edges 
that we can use. Let El be the later nlo & n Q f these edges, and reveal all the non-revealed edges not 
in El- Note that there exists a positive constant C such that whp, the graph induced by the revealed 
edges before beginning this phase has maximum degree at most Clogn (it follows from Chernoff's 
inequality and union bound). Condition on this event. 

We will use the remaining edges El in a slightly different way from how we did in the previous 
phase since in this phase, it will be more important to know if some certain edge is present among the 
non-revealed edges. For an ordered pair of vertices e = (x, y), let the flip of e be r(e) = (y, x) (similarly 
define a flip of some set of pairs). Fix some pair e = (x,y), and suppose that we are interested in 
knowing whether e G El holds or not, and if e G El, then whether it is a blue edge or not. Thus 
for each of the non-revealed edge in El, ask if it is e or r(e). Since we know how many times e and 
r(e) appeared among the already revealed edges, in the end, we not only know if e G El, but also 
know if if it is a blue edge or not. We call this procedure as exposing the pair e, and say that e has 
been exposed. Note that the process of exposing the pair e is symmetric in the sense that even if we 
are looking only for the edge e we seek for the existence of r(e) as well. This is because we would 




Figure 1: Combining two cycles, and rotating a path. 
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like to determine whether e is blue or not at the same time. We can similarly define the procedure of 
exposing a set of pairs, instead of a single pair. We would like to carefully expose the edges in order 
to construct a Hamilton cycle without blue edges. 

Note that the expected number of times that e or r(e) appears in El is -% • = Thus if 

S is the set of exposed pairs at some point, we say that the outcome is typical if the number of times 
that a pair belonging to S appears in El is at most ^ 1 ° gn (which is three times its expected value). 
While exposing sets of pairs, we will maintain the outcome to be typical, since we would like to know 
that there are enough non-revealed pairs remaining in El- For a set X of vertices, let Q{X) be the 
set of ordered pairs {x\,x%) such that x% G X or X2 € X. 

Lemma 6.6. Let X and Y be sets of vertices of size at most ||. Assume that the set of exposed pairs 
so far is a subset of Q(X) and the outcome is typical. Further assume that a path P from vq to vi of 
length t = n — o(n) is given for some Vq, V£ ^ X U Y. 

Then there exists a set Z C V(P) disjoint from Y of size at most \Z\ < i ogn ,io g i ogn such that with 
probability at least 1 — o((logn) _1 ), by further exposing only pairs that intersect Z (thus a subset of 
Q(Z) ), one can find a cycle over the vertices of P. Furthermore, the outcome of exposing these pairs 
is typical and no new blue edges are added (thus the set of blue edges in the cycle is a subset of the set 
of blue edges in P). 

Informally, Y is the set of 'reserved' vertices which we would like to keep non-exposed for later 
usage. The lemma asserts that we can close the given path into a cycle by further exposing pairs that 
intersect some set Z which is disjoint from Y and has relatively small cardinality. 

Proof. Denote the path as P = (vq, vx, ■ ■ ■ ,ve). For a subset of vertices A = {v ai ,v a2 , ••• ,v at }, define 
A~ = {v ai -±, i>a 2 -i> ' ' ' > w a t -i} and A + = {v ai+ \, v a2+ i, ■ ■ ■ ,v at+ \} (if the index reaches either —1 or 
i + 1, then we remove the corresponding vertex from the set). 

Our strategy can be described as following. We repeatedly rotate the path to obtain endpoints, 
and in each iteration select a set of vertices and expose only pairs incident to these vertices (call 
these vertices as the involved vertices). Thus a pair consisting of two non- involved vertices will remain 
non-exposed. The set Z will be the set of involved vertices, and our goal will be to construct a cycle 
while maintaining Z to be small. 

To keep track of the set of vertices that have been involved and the set of endpoints that we 
obtained, we maintain two sets Tj and S 1 , for i > 0, where To = {v^} and Sq = X. Informally, Tj will 
be the set of endpoints that have not yet been involved, and Si will be the set of involved vertices 
while obtaining the set Tj. For example, suppose that we performed a rotation as in Figure [Din the 
first round. We will later see that in the process, we expose the neighbors of vg and V{ for this round 
of rotation to obtain a new endpoint Vj-\. Thus we will add the vertices V£ and V{ to S± and to 
T\. It is crucial to maintain T{ as a subset of the set of non-involved vertices, since we will need to 
expose its neighbors in the next round of rotation. 

Let Yq = Y L){vq}. Throughout the rotation process, Tj and Si will satisfy the following properties: 

(i) for every w € Tj , there exists a path of length I from to w whose set of blue edges is a subset 
of that of P, 

(ii) the set of exposed pairs after the i-th step is a subset of Q(Si), 

(iii) all the breaking points used in constructing the paths above belong to Si U Tj, 
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(iv) \T>\ = 1 and \ Si \ S^\ < 2 * = g|T| (for i > 1), 

(v) XUTi-iUSt-! C 5 i? 

(vi) Sj, Tj, and Yo are mutually disjoint, and 

(vii) the outcome at each iteration is typical. 

Recall that To = {v{\ and So = X, and note that the properties above indeed hold for these sets. 
Since So = X, property (iv) in particular implies that 

N < X + V- T a < X + Ti . 

log n log n 



Suppose that we completed constructing the sets Tj and Sj for some index i so that |Tj| < ( logn ^ log logw 

' log n 
log log n 



By (iv), we have | Si | < |X| + ^g^|Ti| < (JL+ (l))n and i = Q( w°f" )■ We will show how to construct 



the sets Tj + i and Sj+i from these sets. 

By \X\ < (ii), (iv) and (vii), we know that at any step of the process the number of edges in 
El that remain non-revealed is at least 

_ \Q(Sj) \ ■ logn nlogn _ 2[Sj|nlogre relogra 
1 L| n " 6 n " 12 ' 

Moreover, the number of non-exposed pairs remaining is at least 

n 2 -\Q(Si)\ >n 2 -2n\Si\ > y. 
We will make use of the following three claims whose proof will be given later. 

Claim 6.7. Assume that some pairs have been exposed and the outcome is typical. Once we expose 
the remaining edges, the probability that there exists a vertex incident to two new blue edges is at most 
o((logn)~ 2 ). 

Claim 6.8. Assume that some pairs have been exposed and the outcome is typical. Let R be a set of 
pairs of size \R\ = ^(j^^) disjoint to the exposed pairs. Then with probability at least 1 — o((log n)~ 2 ), 

the number of times a pair in R appear among the non-revealed edges of El is at least ^^f" ? an< ^ ^ s 
at most log " , 

2n 

Claim 6.9. Assume that some pairs have been exposed and the outcome is typical. Then with probabil- 
ity at least 1 — o((logn)~ 2 ), for every disjoint sets Ai,A% of vertices satisfying \A±\ < logra (i ^i ogn )i/2 

and \A%\ = ^50^" ; the number of edges between A\ and A2 among the non-revealed edges of El is 
at most ^^oo g " • 

For each vertex w G T, there exists a path P w of length i from vo to w satisfying (i). Let P W) \ be 
the first half and P w2 be the second half of P w . Let Sj+i,o = Sj U Tj, and N = Sj + i 5 o U S~ +1 U Y and 

Qi = {{w, x + ):w£ Ti, x G V(P wA ) \ N}. 

We have Q 1 C Q(T) and 

\Qi\ > \Ti\ ■ C| - 2|Sj| - 2|T| - \Y\ -l)> 
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By (vi) and the definition of N, the pairs in Q\ have both of their endpoints not in Si, thus have 
not been exposed yet. Now expose the set Q\. By Claim HT8l we know that with probability at least 
1 — o((logn)~ 2 ), the outcome is typical, and the number of pairs in Q\ that appear in El is at least 

logn \Tj\ log re 
24n ~ 96 

Condition on this event. Note that if some pair (w,x + ) G Q\ appears in El and is not a blue edge, 
then x can serve as an intermediate point in our next round of rotation. Since we forced not to use the 
same breaking point twice by avoiding the set N (see properties (iii) and (v)), if there is a non-blue 
edge of the form (x, y + ) for some y G P w 2, then we can find a path of length i from vq to y satisfying 
(i) (see Figure [TJ. 
Let 

Si+i.i = {x : (w, x + ) G Qi n El, (w, x + ) is not blue}. 

By Claim [6T71 with probability at least 1 — o((logre) -2 ), among the edges in Qi n El, the number of 
blue edges is at most \TA. Condition on this event. Then the number of non-blue edges between Tj 
and Sf +l 1 is at least q|" ^ > jocf" - By Claim [6T9l with probability at least 1 — o((logn)~ 2 ), 

we see that liSf+i^l > ^'^f" - Redefine Si+^i as an arbitrary subset of it of size exactly ^^f" - Note 
that Si+\ t i H N = 0. The vertices in Si+11 will serve as intermediate points of our rotation. 
Now let 

Q2 = {(x, y + ) '■ x G S i+1>1 , (w, x+) G Qi fl £7 £ , and y G F(P W)2 ) \ (AT U S^)}, 

and note that Q 2 C Q(5'j + i ) i). Further note that we are subtracting SZ 1 1 from V[P w p) m the above 
definition. This is to avoid having both a pair and its reverse in the set Q%. Even though the set Sj+i 1 
was defined as a collection of vertices belonging to P w i 1 for various choices of u>', it can still intersect 
P Wt 2 for some vertex w, since we are considering different paths for different vertices. Similarly as 
before, all the pairs in Q2 are not exposed yet and we have IQ2I > 2 1^+1,1 1- Moreover, with probability 
at least 1 — o((logre) -2 ), the number of pairs in Q2 that appear in El which are not blue edges is at 
least S ' +1 1 'oo 1 ° gn and the outcome is typical. Let Tj+^o = {y : (x,y + ) G Q2 n El, (x,y + )is not blue}. 
As in above, with probability at least 1 — o((log re)~ 2 ), we have jTj+i^l > Moreover, by the 

observation above, for all the vertices y G Tj+i 0, there exists a path of length £ from vq to y satisfying 
«• 

(\ \ 2 A \ 2 (* +1 ) 

Let T i+ i = T i+1>0 and S i+ i = S i+lfi U S i+ i } i. Since \T i+1 \ > f -§§g- J \Ti\ > ( -ggg-J , we may 

2(t+l) 

500"^ 

saw that (i) holds for Tj + i. Property (ii) holds since the set of newly exposed pairs is Q\ U Q2 C 
Q(Si + i t o U Si+i t i) = Q(Si+i). Properties (iii), (v), and (vi) can easily be checked to hold. By Claim 
6.81 the outcome is typical, and we have (vii). For property (iv), the size of Tj + i by definition satisfies 
the bound, and the size of Si+i \ Si is 

\Si+i \ S{\ < \Si + i : o \ Si\ + pi-j-ijil 

, ml iTjllogn /logre\ 2 * logn\ /logre x2 rl 

< TA + 1 ' ° < —I— • 1 + <2' 



redefine Tj + i as an arbitrary subset of it of size exactly ( ^fe? ) . In the previous paragraph we 



500 ~ V 500 J V 500 / - V 500 
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Repeat the above until we reach a set T t of size T , — ; < \TA < ftn „ d " , . By (iv), 

r 1 (log ny -log log n — 1 11 — (500) z log log n J v <" 

we have t = 0( log f ogTt ) and \S t \ < \X\ + 125 log n n log log n ■ Redefine T t as an arbitrary subset of size 
exactly T , ™^ — ; . Note that the size of S* does not necessarily decrease, and thus we still have 

J (log n)^ -log log n 1 •> ' 

\St\ < \X\ + 125 log J! log i ogn - We will repeat the process above for the final time with the sets St and 

T t . This Will give \T t+l \ = (500) ^ og logn aIld \ S *l ^ 125 log n\g log n > fr ° m which 11 foll ° WS that 

l^+i I < i25iogn"o g iogn - Let ^3 = {(«o,z) : z G T m } and expose Q 3 (note that the pairs in Q 3 has 
not yet been exposed since (Tt+l U {^o}) H = 0, while the set of exposed pairs is Q{St+\))- Since 
\Q 3 \ = |T t+1 | = 0( 

log log n ) ' ^ Claims [6]7] and [621 with probability at least 1 o((logn) 2 ), we have 
a pair in Q 3 that appears in El as a non-blue edge. This gives a cycle over the vertices of P whose 
set of blue edges is a subset of that of P. 

For the set Z = (St+i U {^o}) \ X, we see that the set of exposed pairs is a subset of Q(X U Z). 
Furthermore, since Y$ and St+i are disjoint and vq £ Y, the sets Y and Z are disjoint as well. 
By t = 0( lo 1 ° 1 ^ gn ), the total number of events involved is 0( lo 1 gl ^ gw ). Since each event hold with 
probability at least 1 — o((logn)~ 2 ), by taking the union bound, we obtain our set and cycle as 
claimed with probability at least 1 — o((logn) _1 ). □ 

The proofs of Claims 16.71 16.81 and 16.91 follow. 



Proof of Claim \6. 7\ Let G' be the graph induced by the edges that have been revealed before the 
final phase (thus all the edges but El). It suffices to compute the probability of the following events: 
(i) there exist v,w±,W2 € V such that both {v,wi} and {v,W2} appears at least twice among the 
remaining edges, (ii) there exist v,wi,W2 G V such that {v,w±} and {v,W2} were already in G', and 
both appears at least once among the remaining edges, and (iii) there exist v,wi,W2 € V such that 
{v,wx} were already in G', appears at least once among the remaining edges, and {v,W2} appears at 
least twice among the remaining edges. 

The probability of the first event happening is at most 

3 ( nlogn/6\ f 4\ f 2 \ 4 ^ f (logn) 4 



Recall that we conditioned on the event that each vertex has degree at most Clogn in the graph 
induced by the edges revealed before this phase. Consequently, the probability of the second event 
happening is at most 

Clogn\ /nlogn/6\/2\ / 2 \ 2 _ f (log n^ 4 



2 J V 2 J\ l J \ n2 J \ n 

and similarly, the probability of the third event happening is at most 

n2 _ /ClognN _ /nlogn/6\ /3\ _ /^_\ 3 = Q ^(logn^ 4 



1 J \ 3 J \1 J \n 2 J \ n 

Therefore we have our conclusion. □ 

Proof of Claim\KM Recall that at any time of the process, the number of non-revealed edges in El is 
at least n *°| - . The probability of a single non-revealed edge of El being in R is at least Jj^L Therefore 
the expected number of times a pair in R appear among the non-revealed edges is at least, 

\R\ nlogn |i?|logn 
^2" ' 12 ~ 12n 
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On the other hand, recall that at any time of the process, the probability that a non-revealed edge of 
El is some fixed pair at most \, since the number of non-exposed pairs is at least Therefore the 
expected number of times a pair in R appear among the non-revealed edges is at most, 

2\R\ nlogn |i?|logn 
n 2 6 3n 

Since \R\ = f2( log ™ n ), the conclusion follows from Chernoff's inequality and union bound. □ 

Proof of Claim UTM Recall that at any time of the process, the probability that a non-revealed edge 
of El is (v,w) or (w,v) is at most since the number of non-exposed pairs is at least 

Let k be a fixed integer satisfying k < logw . 1 " glogn - Let A\ be a set of vertices of size k and A2 be 
a set of vertices of size kl ^f n n disjoint from A%. The number of choices for such sets is at most 



n 



ml 500en\ fclogn/500 / e 1000 n\ fclo s n / 500 



n k \ " I < I n lo s" • """""" I 
Klogn/500/ \ klogn J 



k log n 



The probability of there being more than fc ^| n edges between A\ and Ai can be computing by first 
choosing kl °Q Q n pairs between A\ and A2, and then computing the probability that they all appear 
among the remaining edges. Thus is at most 



,2i /rnn\ / 1 \ fclogn/100 / a \ fclogn/100 

k log n J 500 \ ( n\ogn\ 6 ' / 4 N 
felogn/100y ' V 3 J [n? 



ek nlogn 4 



5 3 



fclogn/100 



fclogn/100 \ fclogn/100 



Aeklogn\ g ' f klogn 



15n y \ n 

Thus by taking the union bound, we see that the probability of there being such sets A\ and A2 is at 
most 

^/(logr^loglogn) /g i 00 n ^felogn/500 ^ , x fclogn/100 n/(logn loglogn) ^00^4 log 4 n \ * logn/500 



E 



< 



, log n I \ n / ' 

fc=i v 6 7 v 7 fc=i 

Since the summand is maximized at k = 1 in the range 1 < < 1 t^-t , we see that the right 

— — log n-log log n ' 

hand side of above is o((logn) -2 ). □ 

We now can find a Hamilton cycle without any blue edges, and conclude the proof that OrientPrime 
succeeds whp. 

Theorem 6.10. There exists a Hamilton cycle with no blue edges whp. 

Proof. By Proposition 15.21 we whp can find a 1-factor, which by Claims 16.31 and 16.41 contains at most 
logn blue edges that are vertex-disjoint. By Claim HT2l it suffices to find a Hamilton cycle after 
compressing the vertices in B from the 1-factor, since whp there are no blue edges incident to B. 
With slight abuse of notation, we may assume that the compressed graph contains n vertices, and 
that we are given at least wl o gn random edges over this 1-factor. By Lemma [631 by using half of these 
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random edges, we can find a 1-factor consisting of cycles Co, Ci, ■ ■ ■ , C so that |C<j| = n — o(n) and 
t = O(logn). Suppose that there are k blue edges that belong to the 1-factor, for some k = O(logn). 
We still have a set of at least wl ° gn non-revealed edges El that we are going to use in Lemma 16.61 

Let X be a set which we will update throughout the process. Consider the cycle C\. If it 
contains a blue edge, then remove it from the cycle to obtain a path P\. Otherwise, remove an 
arbitrary edge from C\ to obtain Pi = (wo,w\,--- ,w a ). Expose the set of pairs {(w a ,x) : x € 
V(Cq), (w a ,x) is not exposed} which is of size at least |Co| — |X| — 2k = n — o(n). By Claims 16.71 
and !6.8| with probability at least 1 — o((logn) -2 ), the outcome is typical and there exists at least one 
non-blue edge of the form (w a ,x) for some x £ V{C$). Condition on this event. Note that the set 
of exposed pairs is a subset of Q({w a }), and that this gives a path P over the vertices of Co and Pi, 
which starts at wo and ends at some vertex in Ci (thus w a is not a endpoint). Add w a to the set X, 
and let Y\ be the set of vertices incident to some blue edge that belongs to Co or Pi. Note that X, Y\ 
are disjoint, the set of exposed pairs is a subset of Q(X), and neither of the two endpoints of P belong 
to X U Y\. By applying Lemma 16.61 with X and Y = Y±, with probability at least 1 — o((log we 
obtain a cycle that contains all the vertices of Co and Ci. Moreover, the pairs we further exposed will 
be a subset of Q{Z\) for some set Z\ of size at most log - t " g log - . Condition on this event and update 
X as the union of itself with Z\ . Note that by the definition of Y\ , X does not intersect any blue edge 
of the new cycle. 

Repeat the above for cycles C2, C3, •• • ,C. At each step, the success probability is 1 — o((logn) _1 ), 
and the size of X increases by at most 1 + logn .io g i og n - logniogiogn - Since 1 = °( lo Sn), we can 
maintain X to have size o(n), and thus the process above indeed can be repeated. In the end, by the 
union bound, with probability 1 — o(l), we find a Hamiltonian cycle which has at most k blue edges. 
Let Y be the vertices incident to the blue edges that belong to this Hamilton cycle. Note that \Y\ < 2k 
and X n Y = 0. Remove one of the blue edges (y, z) from the cycle to obtain a Hamilton path. Apply 
Lemma 16.61 with the sets X and Y \ {y,z} to obtain another Hamilton cycle with fewer blue edges. 
Since the total number of blue edges is at most k = O(logn), the blue edges are vertex-disjoint, and 
the probability of success is at least 1 — o((logn) _1 ), after repeating this argument for all the blue 
edges in the original cycle, we obtain a Hamilton cycle with no blue edge. □ 

7 Concluding Remarks 

In this paper we considered the following natural question. Consider a random edge process where at 
each time t a random edge (u, v) arrives. We are to give an on-line orientation to each edge at the 
time of its arrival. At what time t* can one make the resulting directed graph Hamiltonian? The best 
that one can hope for is to have a Hamilton cycle when the last vertex of degree one disappears, and 
we prove that this is indeed achievable whp. 

The main technical difficulty in the proof arose from the existence of bud vertices. These were 
degree-two vertices that were adjacent to a saturated vertex in the auxiliary graph D§—in,5—out' 

Note 

that for our proof, we used the method of deferred decisions, not exposing the end-points of certain 
edges and leaving them as random variables. Bud vertices precluded us from doing this naively and 
forced us to expose the end-point of some of the edges which we wanted to keep unexposed (it is not 
difficult to show that without exposing these endpoints, we cannot guarantee the bud vertices to have 
degree at least 2). If one is willing to settle for an asymptotically tight upper bound on t*, then one 
can choose t* = (1 + e)nlogn/2, and then for n = n{s) sufficiently large there are no bud vertices. 



32 



Moreover, since for this range of t*, the vertices will have significantly larger degree, the orienting rule 
can also be simplified. While not making the analysis "trivial" (i.e., an immediate consequence of the 
work in [H]), this will considerably simplify the proof. 

Acknowledgement. We are grateful to Alan Frieze for generously sharing this problem with us, and 
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